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ABSTRACT 

Maximal Independent Set selection is a fundamental prob- 
lem in distributed computing. A novel probabilistic algo- 
rithm for this problem has recently been proposed by Afek 
et al, inspired by the study of the way that developing cells 
in the fly become specialised. The algorithm they propose 
is simple and robust, but not as efficient as previous ap- 
proaches: the expected time complexity is 0(log 2 n). Here 
we first show that the approach of Afek et al cannot achieve 
better efficiency than this across all networks, no matter how 
the probability values are chosen. However, we then propose 
a new algorithm that incorporates another important feature 
of the biological system: adapting the probabilities used at 
each node based on local feedback from neighbouring nodes. 
Our new algorithm retains all the advantages of simplicity 
and robustness, but also achieves the optimal efficiency of 
O(logn) expected time. 

1. INTRODUCTION 

One of the most fundamental problems in distributed com- 
puting is to choose a set of local leaders in a network of con- 
nected processors so that every processor is either a leader or 
connected to a leader, and no two leaders are connected to 
each other. This problem is known as the maximal indepen- 
dent set (MIS) selection problem and has been extensively 
studied [16] [15] [3) [13} (HJ [T9J [18]. 

Different maximal independent sets for the same network 
can vary greatly in size. In contrast to the MIS selection 
problem, the related problem of finding a maximum size 
independent set (MaxIS) is notoriously hard. It is equiv- 
alent to finding a maximum clique in the complementary 
network, and is therefore NP-hard [12] (and hard to ap- 
proximate [lO] ). However, computing an arbitrary MIS (not 
necessarily of the maximum possible size) using a centralised 
sequential algorithm is trivial: simply scan the nodes in arbi- 
trary order. If a node u does not violate independence, add 
u to the MIS. If u violates independence, discard it. Hence 
the real challenge is to compute such an MIS efficiently in a 



distributed way with no centralised control. Here we present 
a new distributed approach to this problem which achieves 
optimal efficiency. 

Afek et al. have recently pointed out the similarity between 
the MIS selection problem and neural precursor selection 
during the development of the nervous system of the fruit 
fly Drosophila [2j. During development, certain cells in the 
pre-neural clusters of the fly specialise to become sensory 
organ precursor (SOP) cells, which later develop into cells 
attached to small bristles (microchaetes) on the fly that are 
used to sense the environment. During the first stage of this 
developmental process each cell either becomes an SOP or 
a neighbour of an SOP, and no two SOPs are neighbours. 
These observed conditions are identical to the formal re- 
quirements in the maximal independent set selection prob- 
lem (see Figure [T]). 

Afek et al. also pointed out that the method used by the 
fly to select the SOPs appears to be rather different from 
standard known algorithms for choosing an MIS 16, 3j |18| . 
These algorithms rely on arithmetic calculations and precise 
numerical comparisons, and generally require explicit infor- 
mation about the number of active neighbours that each 
node in the network currently has. They also generally rely 
on exchanging complex messages representing precise nu- 
merical information. By contrast, the cells of the fly appear 
to solve the problem without clear central control using only 
simple local interactions between certain membrane-bound 
proteins, notably the proteins Notch and Delta pi [7j [8] . 

Afek et al. compared statistics derived from the observed 
SOP selection times with several in silico models for stochas- 
tic accumulation of Notch and Delta. They finally con- 
structed a consistent model with stochastic rate change that 
did not require knowledge about the number of active neigh- 
bours and used only threshold (binary) communication. Based 
on this stochastic rate change model, they proposed a new 
general-purpose distributed algorithm for solving the MIS 
selection problem in an arbitrary network [2]. 

The algorithm proposed by Afek et al. is fully synchronous 
and operates over discrete time steps. At each step, each 
node may choose, with a certain probability (that varies over 
time), to signal to all its neighbours that it wishes to join the 
independent set. If a node chooses to issue this signal, and 
none of its neighbours choose to do so in the same time step, 
then it successfully joins the independent set, and becomes 
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Figure 1: (A) An MIS selected from a random undirected graph with 20 nodes. The set of vertices 
^3,^11,^13,^16,^17 is an MIS because no two nodes in this set are adjacent, and no further node of the graph 
can be added to this set without violating this property. (B) The selection of SOP cells in the fly appears to 
be formally similar to an MIS selection proble: each cell either becomes an SOP or a neighbour of an SOP, 
and no two SOPs are neighbours. 



inactive, along with all its immediate neighbours. However 
if any of these neighbouring nodes issue the same signal at 
the same time step, then the cell does not succeed in joining 
the independent set at that step. This process is repeated 
until all nodes eventually become inactive. This algorithm 
is remarkably simple: it requires no knowledge about the 
number of active neighbours and uses only one-bit messages. 
The computation at each individual node can be described 
by a simple automaton, as shown in Figure [2] 

As originally presented [2], the algorithm uses a sequence 
of gradually increasing global probability values calculated 
from the total number of nodes of the graph and its maxi- 
mum degree. The algorithm was further refined by Afek et 
al. in a later paper 1 . In the later version the probability 
values are chosen according to a fixed pattern, so that the 
individual nodes require no knowledge about the size of the 
graph or its maximum degree. 

However, in both versions the new approach has a major 
drawback: the expected number of time steps required is 
(9(log 2 n). This makes the algorithm usable, but not as fast 
as previous algorithms. The most well-known distributed 
algorithm for maximal independent set selection is the el- 
egant randomized algorithm of [5J [l6], generally known as 
Luby's algorithm, which has an expected running time which 
is O(logn). Several other previous algorithms have also 
achieved an upper bound of O(logn) on the expected num- 
ber of steps required to compute a maximal independent set, 
and this has been shown to be the best possible bound that 
can apply for all networks [18] . 

To investigate the running time in practice, we implemented 



the refined version of the algorithm, described in fil, where 
the probabilities are chosen by repeatedly sweeping across 
a wider and wider range of different values. Following the 
scheme specified in 1 , we divide the computation into phases, 
numbered 1, 2, 3, . . .. Each phase k consists of k + 1 time 
steps. The value of the probability p varies as follows: dur- 
ing each phase k the value of p is 1 initially, and gets halved 
after each successive time step in phase k. Thus, the value 
of p during the computation will take the following values in 
successive time steps: 1, ~, 1, |, |, 1, |, |, |, 1, ~, ~, |, ^, . . . 
(where the underlinings indicate the phases). We ran this 
version of the algorithm on random networks with different 
numbers of nodes, where each edge is present with proba- 
bility 1/2. We found that the mean number of time steps 
required to complete the algorithm and choose a maximal in- 
dependent set in these networks was close to the exact value 
of log 2 n, where n is the number of nodes (and the logarithm 
is to base 2). These experimental results are shown as the 
upper line of data points in Figure [3] 

2. RESULTS 

In this paper we investigate whether the simple algorithmic 
approach proposed by Afek et al. can be improved to make 
it competitive with the standard approaches in the number 
of steps required. Our first analytical result gives a neg- 
ative answer to this question by showing that computing 
a maximal independent set using this algorithmic approach 
will require more than some constant multiple of log 2 n time 
steps on some families of networks with n nodes whatever 
sequence of global probability values is used (see Theorem 1). 

Does this mean that the simple mechanism used by biolog- 
ical cells for "fine-grained" pattern formation has an inner- 
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Figure 3: Actual performance of the computation of the MIS on random networks where each edge is present 
with probability 1/2. The upper points (black) show the mean number of time steps taken by the algorithm 
over 100 trials with global sweeping probabilities as specified in 111. The lower points (blue) the same for 
locally chosen probabilities with feedback. Error bars indicate standard deviations over 100 trials in each 
case. The upper dashed line shows the value of log 2 n and the lower dotted line shows the value of 2.5 log n for 
comparison (all logarithms to base 2). 



1. Set local value of p = 1/2; 

2. while active, at each time step do 



s. FIRST EXCHANGE 

4. With probability p, start signalling to all neighbours; 

5. if any neighbour is signalling then 

6. Stop signalling (if started); 

7. Reduce p 
s else 

9. Increase p (up to a maximum of 1/2) 

10. SECOND EXCHANGE 

11. if signalling then 

12. Join the MIS; 

13. Terminate (become inactive) 

14. else if any neighbour is signalling then 
is. Terminate (become inactive) 



Table 1: The algorithm at each node 



neighbour 
has joined MIS 




neighbours 
not signalling 



Figure 2: Abstract state-based description of the 
process at each node. At each step a node may signal 
that it wishes to join the independent set by moving 
to the state at the top right with probability p (which 
varies with time). It then responds to the signals 
from neighbouring cells. 



ently lower efficiency compared with carefully engineered al- 
gorithms such as Luby's algorithm? To answer that question 
we looked more closely at the mechanism of "fine-grained" 
pattern formation during cell development. 
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Figure 4: Notch-Delta signalling constructing a pos- 
itive feedback to amplify small differences between 
cells. A slight excess of Delta in cell 2 enables the 
positive feedback loop that leads to mutually exclu- 
sive signalling states of the two cells. 



which generates an ultrasensitive switch between two mu- 
tually exclusive signalling states: sending (high Delta/low 
Notch) and receiving (high Notch/low Delta). A slight ex- 
cess of Delta production in one cell can generate a strong 
signalling bias in one direction: the cell becomes a sender 
and its neighbours become receivers [21] . At the multicel- 
lular level, this lateral inhibition mechanism can break the 
symmetry among cells and amplify small differences between 
neighbouring cells, thus facilitating pattern formation (see 
Figure [4]) . 

It is clear that the cells participating in this developmental 
process do not act autonomously - they continuously ad- 
just their behaviour based on the signals being made by the 
cells around them. We abstracted from the positive feed- 
back mechanism of Notch-Delta signalling to construct the 
distributed algorithm described in Table [I] It uses similar 
processing at each node to the algorithm of Afek et al, but 
each node now has its own independently updated probabil- 
ity value. These values all start at 1/2, but are decreased by 
some fixed factor whenever one or more neighbouring cells 
signal that they wish to join the independent set. They are 
also increased by the same factor (up to a maximum of 1/2) 
whenever no neighbouring cell issues such a signal. 

Our main result shows that choosing the probabilities in 
this way, using a simple local feedback mechanism, gives 
an algorithm whose expected time to compute a maximal 
independent set is only O(logn) (see Corollary [5} . Hence 
this new simple algorithm performs as well as all previous 
algorithms for this problem, and, indeed, as well as the the- 
oretical optimal performance. 

To illustrate this result we implemented the new algorithm 
with the probability p at each node varying as follows: p 
is initially set to 1/2. At any time step where there is a 
signal from at least one neighbouring cell the value of p is 
halved. At all other time steps it is doubled (up to a max- 
imum of 1/2). We then ran this new algorithm on random 
networks with different numbers of nodes, where each edge 
is present with probability 1/2. We found that the mean 
number of rounds required to complete the algorithm and 
choose a maximal independent set in these networks was 
now close to 2.5 log n, where n is the number of nodes (and 
the logarithm is again to base 2) . These experimental results 
are shown as the lower line of data points in Figure [3] 



The Notch-Delta signalling pathway provides a communi- 
cation channel between neighbouring cells during develop- 
ment. It is thought to play a critical role in the formation 
of "fine-grained" patterns in the development of many or- 
ganisms 21 , 9 , helping to generate distinct cell fates among 
groups of initially equivalent neighbouring cells. In partic- 
ular, many studies have shown that Notch-Delta signalling 
regulates the selection of Drosophila neural precursors from 
groups of equipotent proneural cells in a way which resem- 
bles the MIS selection problem [5] [4j [21] . The transmem- 
brane protein Delta has been shown to have two activities: 
Delta in one cell can bind to, and transactivate, the trans- 
membrane protein Notch in its neighbouring cells [5]; Delta 
and Notch in the same cell mutually inactivate each other 
TT ~ 



[17 . The interaction of the transmembrane proteins 



Notch and Delta constitutes a positive feedback mechanism 



3. LOWER BOUND FOR GLOBALLY 
CHOSEN PROBABILITY VALUES 

Following 1 , we refer to the signalling at each node as 
"beeping". At each time step, each node chooses to beep with 
a certain probability and this beep is immediately heard by 
each of its neighbours. 

In this section we consider the class of algorithms described 
in [2] where each node runs through a preset sequence pi , p2 , . . 
of probabilities for beeping. We assume that all nodes beep 
with probability pi in the ith time step. Our first result gives 
an explicit family of graphs with 0(n) vertices, for which any 
such algorithm takes at least a fixed multiple of log 2 n steps. 
(We generally omit floors and ceilings for clarity.) 



Theorem 1. There is a constant a > such that the fol- 
lowing holds. Let G be the graph consisting of n 1 ^ 3 disjoint 
copies of the complete graph Kd, for each d — 1, . . . , n 1 / 3 . 
Let T = a log 2 n and let p±, . . . ,pt be any sequence of prob- 
abilities. Then with high probability, the algorithm using the 
probability sequence pi, . . . ,pt does not terminate within T 
steps. 

Proof. Fix d, and consider a copy K of Kd. The prob- 
ability that some vertex of K is added to the independent 
set at the zth step is the probability that exactly one vertex 
of K beeps, and so equals 

dp^l-pi)^ 1 < dpiexp(-(d - l)pi). (1) 

Note that the function xe~ x is bounded on [0, oo), and has 
maximum 1/e (at x = 1). So for d > 2, 

dpiexp(-(d-l)pi) j^-j-(d-l)piexp(-(d-l)pi) < 

Also, for x G [0,3/2e], we have 1 — x > exp(— 2x). So, by 
inequality ^ the probability that all the vertices of K are 
still active after T steps is at least 
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> exp(— 6dpje~ dpi ) 



The last inequality follows from the fact that e Pi < e < 3. 

Hence if J2f=i §dpie~ dpi < \ logn then the nodes of K re- 
main active with probability at least n _1//4 . In that case the 
probability that the nodes in all the copies of Kd become 
inactive is at most 

/-, -lMxn 1 / 3 ^ / 1/12n 

(1 — n ' ) < exp(— n ' J, 

and so with high probability (i.e, tending to 1 as n oo) 
the algorithm fails to terminate. 

It follows that we may assume that YlJ=i 6dpie~ dpi > | log n 
for every choice of d > 3. We will show that this implies 
T = ft(log 2 n). 

Let us choose d at random. We define a probability distri- 
bution for d by 



r logn 



for r = 3, . . . , n 1 (where c is a normalizing constant: note 

1/3 

that c = 6(1), as J^ILi l/r = G(logn)). Then, for any 

pe[o,i], 
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But J2^L P e ~ rp = P/i 1 ~ e ~ P ) < 2, as p G [0, 1]; so we have 
K[dpe~ dp ] < 2c/ logn. By linearity of expectation, choos- 
ing a random d, we have EE i=1 6dpie~ dpi ] < 12cT/logn. 



Hence there is some value of d for which Yli=i §dpie~ dpi < 
12cT/logn. By the argument above, this quantity must be 
at least | logn, and so we must have T = Q(log 2 n). □ 

4. UPPER BOUND FOR LOCALLY 
CHOSEN PROBABILITY VALUES 

In this section we consider the new class of algorithms de- 
scribed in Table ^ where each node maintains its own inde- 
pendent probability value which varies over time. 



Definition 1. We define the following distributed algorithm 
for computing a maximal independent set in an arbitrary 
graph. 

At each time step t, there is an integer n(v,t) attached to 
each vertex v, and v beeps with probability 2~ n(yt,v \ We set 
n(0,v) = 1 for every v. 

At each time step, we update according to the following local 
rules: 



• If v beeps and no neighbour of v beeps, then v is added 
to the independent set and becomes inactive (along 
with its neighbours). 

• If some neighbour w of v beeps, and no neighbour of w 
beeps, then v becomes inactive (as w is added to the 
independent set). 

• If v does not beep and no neighbour of v beeps, we set 
n(t + 1, v) = max{n(t, v) — 1, 1}. 

• If some neighbour of v beeps, but no neighbour is 
added to the independent set, we set n(t + l,v) = 
n(t,v) + l. 

It follows from the analysis of [2] that if this algorithm ter- 
minates (i.e., all nodes become inactive) then it correctly 
identifies an MIS. The only question is the number of time 
steps required. 



Note that, unlike Luby's algorithm [3j[16j, it is not true that 
at every time step we can expect at least some constant frac- 
tion of the edges to be incident to nodes that become inactive 
at that step. For example, in a complete graph nodes will 
only become inactive when exactly one node beeps. The 
probability of this happening at the first step is only n/2 n , 
so at the first step the expected number of edges that are in- 
cident to nodes that become inactive is only n 3 /2 n+1 . Hence 
we must carry out a more detailed analysis over a sequence 
of time steps. 



Theorem 2. There is a constant Ko such that the fol- 
lowing holds: For any graph G with n vertices, and any 
k > 1, the algorithm defined above terminates in at most 
Ko(k + 1) logn steps, with probability at least 1 — e, where e 
isO(l/n k ). 



Before beginning the proof of Theorem [2] it will be useful 
to define some notation and record a few simple facts. 



Let us define a measure /it(-) on V — V(G) by setting 
/J>t(v) — ¥[v beeps at time t] — 2~ n(<t,v \ 

(By convention, we set fJbt(v) = if v is inactive at time 
t; this simplifies notation, while allowing us to ignore the 
contribution of inactive vertices.) For any S C V we write 
/xt(5) for TtvesMv)- 

We will frequently use the following inequality, which holds 
for 5 e [0,1]: 

1-5 < exp(-<5) < 1 - V 2 - ( 2 ) 

We will also need the following Chernoff-type inequality: if 
X is a sum of Bernoulli random variables, and KX — fi, then 
for every S > 0, 

F[X > fi + S] < exp(-£ 2 /(2/x + 2<5/3)). 

In particular, 

P[X > 2/i] < exp(-/i/3). (3) 
We will also need the following simple bounds. 

Proposition 3. For any set S of vertices, 

exp(— 2/it(S0) < f[no vertex in S beeps at time t] 
< exp(- Mf (5)). 

PROOF. By inequality the probability that no vertex 
in S beeps at time t is 

- **(«)) < If = exp(- Mt (S)). 

v(ES v(ES 

On the other hand, 

fi t (v)) > H exp(-2fi t (v)) = exp(-2/x t (S)), 

where the last inequality used |2]) and the fact that \i t (x) < 
1/2 for every x and t. □ 

The set of vertices adjacent to a given vertex x will be called 
the set of neighbours of x, and denoted T(x). We will say 
that a vertex x at time t is X-light if /^(r(x)) < A, that is, 
if the total weight of its neighbours is not too large (and so 
it is not too likely to hear a beep at time t). Otherwise we 
say that x is X-heavy. Note that a fixed vertex may move 
back and forth between being heavy and light over time. 

Our first result shows that a light vertex is quite likely to be 
added to the independent set if it beeps. 

Lemma 4. Let S be a set of X-light vertices at time t. 
Then the probability that some vertex in S is added to the 
independent set at time t is at least e _2A (l — e~^ t(yS ^). 

Proof. Let us order the vertices of S as si, . . . , s r . Then 
the probability that some vertex of S is added to the in- 
dependent set at time t is at least the probability that the 



earliest vertex of S that beeps is added to the independent 
set. For i— 1, . . . , r, define events Ei and Fi by 

Ei = (si beeps; si, . . . , Si-i do not beep) 

Fi = (no neighbour of Si beeps). 

The events E{ D Fi are pairwise disjoint, so the probabil- 
ity that the earliest vertex of S that beeps is added to the 
independent set is 

r r r 

n\j{Ei nfi)] = Y.^ Ei nFi i = X^t^M^i^]- 

i—l i—1 i—1 

It is easily seen that P[Fi|£?i] > P[Fi], and so by Proposi- 
tion |3] we have 

F[Fi\Ei] > F[Fi] > exp(-2/xt(r(s»))) > exp(-2A), 

as Si is A- light, and so 

r r 

^2¥[Ei]¥[Fi\Ei] >exp(-2A)^P[£i]. 

i=l i=l 

But 21=1 ^[Ei] is simply the probability that some vertex in 
S beeps, and so by Proposition [3] is at least 1 — exp(— /j, t (S)). 
Thus the probability that some vertex of S is added to the 
independent set at time t is at least 

r 

exp(-2A)^P[£i] > e~ 2A (l -e~^ {s) ). 

i=l 

□ 

Proof of Theorem [2] Fix a vertex v. Let K = 10 11 
and set K = K (k + 1). Let a = 10~ 3 , (3 = 1/50 and A = 7. 

We shall show that, with failure probability 0(l/n fe+1 ), v 
becomes inactive within if log n = (k + 1) • i^ologn steps. 
Taking a union bound over all n choices of v, it follows that 
with failure probability 0(l/n k ) every vertex becomes in- 
active and the algorithm terminates within K log n steps, 
which proves the theorem. 

For t > 1, let 

L t = L t (v) - {x e T(v) : fJtt(T(x)) < X} 
H t = H t (v) = {x e T(v) : fJLt(T(x)) > A}. 

Thus L t (v) U H t (v) partitions the neighbourhood of v into 
light and heavy vertices. We will follow the behaviour of 
fjLt(Lt) and fit(Ht) over time. 

The idea of the argument is roughly as follows: if Ht(Lt) is 
large at many time steps, then by Lemma ^] it is very likely 
that some neighbour of v will be added to the independent 
set on one of these occasions, leading to v becoming inactive. 
If this does not happen, then fMt(L t ) will be small most of the 
time, and we can concentrate on Ht. Now vertices that are 
heavy at time t are likely to hear beeps and so drop in weight 
(as their beeping probability drops); it will follow that with 
high probability /j,t+i(H t ) is a constant factor smaller than 
/jLt(Ht) most of the time. Now we look at the evolution 
of fit(T(v)): it may be large and increasing for some small 
fraction of the time, but mostly it is either shrinking or 
else it is already small. It will follow that most of the time 



fit(r(v)) is small. But then most of the time v will not hear 
beeps. Since this also implies that ^t(v) will be large most 
of the time, it is very likely that v will beep and not hear 
any beeps, and so get added to the independent set. 

At each time step t, we consider the following four possible 
events: 

(El) /Jt(Lt) > a [ l T(v) has a significant weight of light 
neighbours'] 

(E2) fJLt(Lt) < a and /i t (r(v)) < ft [v is very light'] 

(E3) fjLt(L t ) < a, fjL t (T(v)) > /3 and /i t+1 Q») < ^/it(I») 
['the neighbourhood of v shrinks significantly in weight 
during step £'] 

(E4) fjL t (Lt) < a, fjL t (T(v)) > and /i t+ i(I») > ^/i t (I») 
['the neighbourhood of v does not shrink significantly 
in weight during step t (and may grow)'] 

Exactly one of these events must occur at each time step. 
Note that we know whether (El) or (E2) occur at the begin- 
ning of the time step; if neither occurs, then we must look 
at the beeps to determine which of (E3) and (E4) occurs. 

We organize the proof as a series of claims. 

Claim 1. With failure probability 0(l/n k+1 ), (El) oc- 
curs at most (if log n)/40 times in the first Klogn time 
steps. 

Each time that (El) occurs, it follows from Lemma [4] that 
some vertex of Lt is added to the independent set (and so 
v becomes inactive and the process at v terminates) with 
probability at least e~ 2A (l - e ~^ Lt) ) > e~ 2A (l - e~ a ). 
Let a = e _2A (l — e~ a ): the probability that (El) occurs 
(if logn) /40 times without v becoming inactive is at most 
(1 -ci) (Klogn)/40 < exp(-(ciK /40)(fc + l)logn). By our 
choice of ifo, we have ifo > 40/ci, so this probability is less 
than exp( — [k + 1) logn) = n~^ k+1 \ This proves the claim. 

The bad event for us will be (E4), so let us bound the prob- 
ability that (E4) occurs. 

Claim 2. At each time step t, the probability that (E4) 
occurs is at most 1/80. 

If (E4) can occur, then we must have fit(L t ) < a and 
fit(T(v)) > f3. If x G Ht then the probability that no 
neighbour of x beeps at time t is at most exp(— \i t (r(x)) < 
exp(— A). Let H® be the set of vertices in H t that do not hear 
a beep at time t, and let H\ = H t \ H?. Then E[/i t (if t )] < 
exp(— \)/jLt(H t ), and so by Markov's inequality 

P[M#t°) > 80exp(-A)/it(# t )] < 1/80. (4) 

Now all vertices in H\ must halve their weight at the next 
time step, while vertices in Lt and H® may either halve or 



double their weight (additionally, some weights may get set 
to if vertices become inactive). So 

< \nt{Hl) + 2MH?) + 2fH(L t ) 

= \nt(r(v)) + \nt(H°) + ^ t (Lt) 

It follows from Q that, with probability at least 79/80, 

/*+i(r(«)) < \»t{T{v)) + ^80e- X fi t (H t ) + ^ f (L f ) 

where the final inequality follows from our choices of a, /3 
and A, as Ht(L t ) < a and fit(T(v)) > /3. Thus the prob- 
ability that (E4) occurs is at most 1/80. This proves the 
claim. 

Claim 3. With failure probability 0(l/n k+1 ), (E4) oc- 
curs at most (if logn) /40 times in the first Klogn time 
steps. 

At each step, the probability of (E4) depends on the past 
history of the process. However, by Claim [2] it is always at 
most 1/80, and so we can couple occurrences of (E4) with 
a sequence of independent events each occurring with prob- 
ability 1/80. It follows that the number of occurrences of 
(E4) in the first if logn time steps is stochastically dom- 
inated by a binomial random variable X with parameters 
if logn and 1/80. The probability that (E4) occurs more 
than (K logn)/40 times is therefore, by (|3|, at most 

P[X > 2EX] < exp(-EX/3) < exp(-(if logn)/240) 

which is 0(n - ( fc+1 )), proving the claim. 

From Claim [l] and Claim [3] we conclude that with failure 
probability 0(n~ (fe+1) ), (El) and (E4) altogether occur at 
most K (logn)/20 times in the first if logn time steps. We 
next show that, with small failure probability, fjbt(T(v)) is 
small most of the time. 

Claim 4. With failure probability 0(1/ n k+1 ), fit (T(v)) > 
2/3 for at most (if logn) /6 time steps in the first Klogn 
time steps. 

Let T be the set of times t > 1 at which /j,t(r(v)) > 2/3. 
We decompose T into (maximal) intervals of integers, say 
as Ti U • • • U T r . Let Ti = [si,U] be one of these intervals. 
We colour each integer t £ Ti red if (El) or (E4) occurred 
at the previous step, and blue if (E3) occurred (note that 
(E2) cannot occur, as fi t -i(T(v)) > fi t {T(v))/2 > /3). Let 
ri be the number of red elements and b% the number of blue 
elements. We have /jbt(T(v)) < \±t-\ (T(v))/V2 at blue steps, 
and fit(T(v)) < 2fit-i(T(v)) otherwise. So 

< M^-i(I») • 2 r */V2 h < /x s ,-i(r») • 2 r *"i\ 
Since ^ ti (T(v)) > 2/3 it follows that 

n > + log 2 2/3 - log 2 0u Si _i(I>))). 



However, /j JSi -i(T(v)) < 2/3 in all cases where Si > 1, and 
/jbo(r(v)) < \n. Summing over i, we see that the total num- 
ber of red elements in T is greater than half the total num- 
ber of blue elements in T plus log 2 2/3 minus log 2 \n. But 
red steps correspond to events (El) and (E4), which alto- 
gether occur at most (K log n)/20 times in the first K log n 
time steps, so the total number of elements in T is less than 
(K\ogn)/6. This proves the claim. 



Claim 5. With failure probability 0(l/n k+1 ), v hears a 
beep at most (K\ogn)/3 times among the first Klogn time 
steps 

From the previous claim, we may assume that fjLt(T(v)) < 2/3 
for at least (5/6)i^logn steps out of the first i^logn time 
steps. If fjb t (T(v)) < 2/3, it follows from Proposition [3] that 
the probability that v hears no beeps is at least 

exp(-2/x t (I»)) > exp(-4/3) > 1 - 40, 

and so v hears a beep with probability at most 4/3 < 1/12. 
Then ^ implies that with failure probability 0(n~ (fc+1) ) 
there are at most (K log n)/6 steps among the first K log n 
at which /jL t (r(v)) < 2/3 and v hears a beep. Since there are 
at most (K log n)/6 steps at which fit(T(v)) > 2/3, it follows 
that, with failure probability 0(n~^ k+1 ^), v hears beeps at 
most (Klogn) / 3 times among the first if log n time steps, 
proving the claim. 



Claim 6. With failure probability 0(1/ n k+1 ), either v be- 
comes inactive or fit (v) = /j, t +i(v) = 1/2 at least (K logn)/3 
times in the first K log n time steps. 

Suppose that /j, t +i(v) — ii t (v)/2 on a occasions, fjL t +i(v) = 
2fjbt(v) on b occasions and /j, t +i(v) = fJ>t(v) = 1/2 on c occa- 
sions. We know that a < (K logn)/3, by the previous claim, 
and b < a (as /jLo(v) = 1/2, and fit(v) is bounded above by 
1/2), so either v becomes inactive in the first K log n time 
steps, or else we must have c > (K logn)/3. 

We can now complete the proof of Theorem[2] If /J,t(v) = 1/2 
and v hears no beeps at time t then v becomes inactive with 
probability 1/2. The probability that v remains active for 
at least (K\ogn)/3 such steps is at most (1 /2)( K1 °s™)/ 3 = 
0(n _ ( fc+1 )). On the other hand, by the claim above, with 
failure probability 0(l/n k+1 ), either v becomes inactive or 
there are at least K(\ogn)/3 steps in the first K log n at 
which iit(v) — Ht+i(v) — 1/2, and so in particular fit(v) = 
1/2 and v hears no beeps. We conclude that v becomes 
inactive with failure probability 0(n~^ k+1 ^). □ 

Corollary 5. The expected number of steps taken by 
the algorithm in Definition [7] on any graph with n nodes 
is O(logn). 

Proof. Let T be the total number of steps taken by the 
algorithm and let T' = \T/(Ko\ogn)], where Ko is the 
constant identified in Theorem |2j 



By Theorem[2] we have that, for any k > 1, P[T' > k + 1] < 
c/n k for some constant c. Hence P[T X = k + 2] < c/n k , so 
E[T r ] < 1 + 2 + ^2 ( ^ 3 ic/n' l ~ 2 = c, for some constant c. 
Hence E[T] is 0(logn). □ 

Theorem [2] and Corollary |5] establish upper bounds on the 
running time that are logarithmic in the number of nodes. 
Our simulations show that in practice the constants are 
rather low, leading to an efficient practical algorithm (see, 
for example, Figure [3]). 

5. BIT COMPLEXITY 

We have shown that the expected number of time steps to 
complete the execution of the randomised algorithm defined 
in Section [4] grows at most logarithmically with the number 
of nodes. Another important resource to be considered in 
any distributed algorithm is the total number of messages 
sent (the message complexity), and their total size (in bits), 
which is sometimes referred to as the bit complexity. In 
this section we will show that the expected number of times 
that each node beeps is bounded by a constant. Hence the 
expected bit complexity per channel for this algorithm does 
not increase at all with the number of nodes. 

Theorem 6. The expected total number of beeps emitted 
by any node executing the algorithm in Definition^is 0(1). 

Proof. Let v be a node executing the algorithm in Defi- 
nition [l] and consider the whole sequence of time steps until 
v becomes inactive. At each step, the probability that v 
beeps is 2~ n ^ v,t \ which we will denote by p t . The initial 
value of pt is | . 

Consider first the subsequence of steps where v hears a beep 
from its neighbours, halves its own probability of beeping, 
and reaches a value p t that is its lowest value so far. The 
expected number of times that v beeps during this subse- 
quence of steps is| + | + | + -- -<l. 

At all other time steps, one of the following 3 things happens: 

Case 1 v hears no beep from its neighbours, and so doubles 
its probability of beeping (from pt to 2pt) 

Case 2 v hears a beep from its neighbours, and so halves 
its probability of beeping (from p t to \pt) 

Case 3 v hears no beep from its neighbours, and so leaves 
its probability unchanged at the maximum value of | . 

Now we consider how many times v beeps during these steps. 

If v beeps at any step where Case 3 holds, then it becomes 
inactive, so the total number of beeps at such steps is at 
most one. 

For each step t where Case 1 holds, and the probability in- 
creases from p t to 2p t , we look for a corresponding step t' > t 
when the probability next drops back to pt. (If there is no 
such step, because v becomes inactive before the probabil- 
ity returns to pt, then we simply add a dummy step t' to 
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Figure 5: Actual performance of the computation of the MIS on random networks where each edge is present 
with probability 1/2. The upper points (black) show the mean number of beeps at each node over 200 trials 
with global sweeping probabilities as specified in [1]. The lower points (blue) the same for locally chosen 
probabilities with feedback. Error bars indicate standard deviations over 200 trials in each case. 



the end of the sequence.) Note that each step where Case 2 
holds is now paired with an earlier step where Case 1 holds. 

Now consider the pairs (£, £'), and define B t to be the event 
that v beeps at either time t or time t' (or both). The total 
number of times that v beeps at steps where Case 1 or Case 
2 holds is at most twice the number of times that events of 
the form Bt occur. 

If B t occurs, then the probability that v beeps at time t 
is p t /(pt(l - 2p t ) + 2p t (l - Pt) + 2p 2 t) > |. However, if v 
beeps at time t, then it is added to the independent set, and 
becomes inactive. Hence the expected number of events Bt 
that occur before v becomes inactive is less than 3. 

We have shown that the expected number of times that v 
beeps is less than 1 + 1 + 2x3, which proves the result. □ 



Our simulations show that in practice the mean number of 
times that each node beeps is very low and does not increase 
with the size of the graph. For example, for random graphs 
with edge probability |, and for rectangular grid graphs it 
is around 1.1 (see Figure [5]). 

Afek et al. do not discuss the expected number of beeps 
at each node in their algorithm with sweeping global prob- 
abilities [l]. Our experimental results for this algorithm on 
random graphs with edge probability \ are shown in Fig- 
ure [5] and appear to increase with the size of the network. 
However, when the initial global probabilities are calculated 
from the overall network size, and gradually increased, as 



described in 12], the mean number of beeps at each node 
does appear to be bounded by a constant. This is consistent 
with the claim made in 2 that the message complexity for 
this version of the algorithm is optimal. 

6. CONCLUSION 

In conclusion, we have constructed a simple randomised al- 
gorithmic solution to a fundamental distributed computing 
problem - the maximal independent set selection problem, 
inspired by the intercellular signalling mechanism used for 
"fine-grained" pattern formation in many biological organ- 
isms. Our algorithm has optimal expected time complexity, 
and optimal expected message complexity. To achieve this 
good performance it is necessary to exploit local feedback - 
we have shown analytically that it cannot be made so time 
efficient if the processors do not adapt their probability of 
signalling to their local environment. 

Our algorithm uses simple identical processors and one-bit 
messages, and its expected running time grows only loga- 
rithmically with the number of nodes, and is therefore much 
lower than the time required by a centralised sequential al- 
gorithm. Moreover, the individual processors need no infor- 
mation about the global properties of the network, such as 
its size, and do not need to identify which neighbours sent 
which message. These features make the algorithm useful 
for many applications, such as ad hoc sensor networks and 
wireless communication systems. Selecting a maximal in- 
dependent set can also be used as a fundamental building 
block in algorithms for many other problems in distributed 
computing. 



We also note that the algorithm we have proposed here is 
highly robust, in the sense that it retains its good perfor- 
mance even when various features are changed. For exam- 
ple, the probabilities at each node do not need to increase 
and decrease by a precise factor - the analysis we have given 
here can be adapted to a wide range of different values for 
these factors, which may vary between nodes and over time. 
Similarly, the initial values for the probabilities at each node 
may be different from |, and may vary from node to node, 
without any significant impact on performance (as long as 
sufficiently many of them are bounded away from zero) . This 
robustness is likely to be a key feature of the algorithm in 
any biological context. 

The relationship between "fine-grained" pattern formation 
and the maximal independent set selection problem that we 
have investigated here is therefore a strikingly successful ex- 
ample of the increasing convergence between systems biology 
and computational thinking |2Q| . 
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